Archival Repositories for Digital Libraries

نویسنده

  • Jennifer Widom
چکیده

Current libraries use an assortment of uncoordinated and unreliable techniques for storing and managing their digital information. Digital information can be lost for a variety of reasons: magnetic decay, format and device obsolescence, human or system error, among many others. In this thesis, we address the problem of how to build archival repositories (AR). An AR has the following combined key requirements that distinguish it from other repositories. First, digital objects (e.g., documents, technical reports, movies) must be preserved indefinitely, as technologies, file formats, and organizations evolve. Second, ARs will be formed by a confederation of independent organizations. Third, published digital objects have a historical nature, so changes should not be done in-place; instead, they should be recorded in versions. We provide an architecture for ARs that assures long-term archival storage of digital objects. This assurance guarantee is achieved by having a federation of independent but collaborating sites, each managing a collection of digital objects. We also provide a framework for evaluating the reliability and cost of an AR. Finally, we present techniques for efficient access of documents in a federation of independent sites.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Archival Repositories for Digital Libraries Extended

This paper studies the archival problem: how a digital library can preserve electronic documents over long periods of time. We analyze how an archival repository can fail and we present diierent strategies that help solve the problem. We introduce ArchSim, a simulation tool that for evaluating an implementation of an archival repository system and compare options such as diierent disk reliabili...

متن کامل

Modeling Archival Repositories for Digital Libraries

This paper studies the archival problem: how a digital library can preserve electronic documents over long periods of time. We analyze how an archival repository can fail and we present different strategies that help solve the problem. We introduce ArchSim, a simulation tool that for evaluating an implementation of an archival repository system and compare options such as different disk reliabi...

متن کامل

A Unique Arrangement: Organizing Collections for Digital Libraries, Archives, and Repositories

Digital libraries increasingly host collections that are archival in nature, and contain digitized and born-digital materials. In order to preserve the evidentiary value of these materials, the collection organization must capture the general context and preserve the relationships among objects. Archival processing is a well-established method for organizing collections this way. However, the c...

متن کامل

MANENT: An Infrastructure for Integrating, Structuring and Searching Digital Libraries

Digital Libraries represent the commitment of research communities to preserve authoritative and well structured sources of knowledge, and to share archival organisations, methods and resources thanks to systems relying on standard metadata formats. This chapter describes some natural language processing techniques exploited for automatically extracting structural information about documents st...

متن کامل

Risk management foundations for digital libraries: DRAMBORA (Digital Repository Audit Method Based on Risk Assessment)

This paper proposes the use of the DRAMBORA (Digital Repository Audit Method Based on Risk Assessment), the Digital Curation Centre (DCC) and DigitalPreservationEurope (DPE) audit toolkit for digital repositories, as a tool to ensure the preservation capabilities of digital libraries. Digital repositories lie at the heart of digital libraries: ensuring long-term sustainability of their content ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003